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Abstract. Compressed sensing (CS) schemes are proposed for monostatic as well as syn- 
thetic aperture radar (SAR) imaging with chirps. In particular, a simple method is developed 
to improve performance with off-grid targets. Tomographic formulation of spotlight SAR 
is analyzed by CS methods with several bases and under various bandwidth constraints. 
Performance guarantees are established via coherence bound and the restricted isometry 
property. CS analysis provides a fresh and clear perspective on how to optimize temporal 
and angular samplings for spotlight SAR. 



1. Introduction 

Advances in compressed sensing (CS) and radar processing have provided tremendous 
impetus to each other. On the one hand, the two CS themes of sparse reconstruction 
and low- coherence, pseudo-randomized data acquisition are longstanding concepts in radar 
processing. On the other hand, CS contributes provable performance guarantees for sparse 
recovery algorithms and informs refinement of these algorithms. These and other important 



issues relevant to CS radar are thoroughly reviewed in 13 24] (see also the references therein). 



Target sparsity, a main theme in CS, arises naturally in radar processing. According 



to the geometrical theory of diffraction 21 , the scattering response of a target at radio 
frequencies can often be approximated as a sum of responses from individual reflectors. 
These scattering centers provide a concise, yet physically relevant, description of the object 



18 . A spiky reconstruction of reflectivity may thus be highly valuable for automatic target 



recognition. More generally radar images are compressible by means of either parametric 



models of physical scattering behaviors or transform coding 24 . 

In the present work, we consider point as well as extended targets. For point targets, a 
main drawback, however, of the standard CS framework often neglected by previous works 
is the reliance on a underlying grid. In reality, the dominant scattering centers can not be 
assumed to be positioned exactly at the computational grid points. Indeed, CS-algorithms 



often break down when the targets are located between grid points 13 16 . In order to reduce 
gridding error, the grid has to be refined, giving rise to higher coherence of the measurement 
matrix which is detrimental to standard CS algorithms. 

Can CS approach be extended to the case of arbitrarily located targets? Several approaches 
have been proposed to address this critical question [iJ[TTJ[T6|[T7] and all of them focus on 
algorithmic improvements. In this paper we propose a simple, alternative approach, which 
focuses on improving the accuracy of the measurement matrix, and establish performance 
guarantee for the refined system (Sections [3] and [4]). 

We then turn to the spotlight mode of synthetic aperture radar (SAR). We consider the 



tomographic framework of spotlight SAR as formulated in 22 for different classes of targets, 
including point targets, localized extended targets and distributed extended targets. These 
different targets require different sparsifying bases which in turn demand different sampling 



schemes. We propose several sampling schemes and establish performance guarantee for 
them (Section [5]). In particular, we examine the realistic case of high-frequency, narrow- 
band signals and derive an error bound. In each case, we show that CS can achieve the 
standard resolution limit with sparse sampling. Finally we present numerical experiments 
demonstrating our results (Section [6]) and draw conclusion (Section m). 

2. MONOSTATIC SIGNAL MODEL 

Let us first formulate the signal model for the case of a monostatic radar with narrow 
fractional bandwidth and colocated transmitter and receiver antennas. A complex waveform 
/ with the carrier frequency uq is transmitted. Let r and v denote the range and the radial 
velocity, respectively. We parameterize the complex scene by the reflectivity function p(r, u) 
where the delay r = 2r/co is the round-trip propagation time and u = 2vuq/cq is the Doppler 
shift. Under the far-field and narrow-band approximations [7], the scattered signal is given 
by 

(1) y(t) = JJx(r,u)f(t~T)e- 2wiut dudT + w(t) 

where 

x( T ,u) =p(r,M)e-" iur 
and w(t) represents the circular white complex Gaussian baseband noise. 
(1) For immobile targets, p(r,u) = p(r)5(u), eq. ([I]) becomes 



(2) y(t)= / p (r)f(t-r)dr + w(t). 

J —00 

For immobile point targets located at {r^}, eq. (J2J) becomes 

(3) y{t) = Y,Pkf{t-T k )+w{t). 

k 

(2) For moving point targets located at {r^} with Doppler shifts {ui}, we have 

p(r,u) = ^p M 5(r - r k )5(u - u{), 
k,i 

and 

(4) y(t) = - r k )e- 2 ™ lt + w{t). 

k,l 

For the transmitted signal, let It be the indicator function of duration [0,T] of transmis- 
sion. By far the most commonly used waveform is the linear FM chirp 

(5) / LC (t) = exp [27ri (y t 2 + w t) ] I T (t) 

owing to the simplicity in implementation. The bandwidth of linear chirp can be defined as 
B = aiT. 

Motivated by the low coherence of the Alltop sequences [j], we will also consider the 
quadratic FM chirp: 

(6) / QC (t) = exp [27ri (^t 3 + ^-t 2 + w t)] l r (t) 



whose bandwidth is given by B = a 2 T 2 + a±T. The Alltop sequences have been analyzed in 
the stylized CS-radar work in [19] . 

3. Immobile targets 

3.1. On-grid targets. With linear chirp and immobile point targets, the signal model is 
given by 

y^i) = Pk ex P [ 27ri ~ Tfc ) 2 + ^(tj - r k )^j 

k 

= hc{tj) 22 Pkfhc{-Tk) exp {-2-KiaiTktj} . 

k 

Suppose first that the targets are located exactly on the grid points. Let T k = kAr where 
At is the grid size and k — 1, . . . , n, n > m. Let 

Q 



(7) 
and 

Then with 



ai = TAr' QeN 
% =t j /Te [0,1]. 



(9) 
(10) 



y(tj 



exp [— 2%iaiTktj] = exp [— 2niBkATtj\ = exp [ — 27riQfctj] 

we can write the signal model as the linear system Y = FX + E where E = [w(tj)} G C m 
represents the measurement errors. The bandwidth is given by B = a{T = Q/ At. 

A main thrust of CS is the performance guarantee for the basis pursuit denoising (BPDN): 

X = argmin||Z||i, \\FZ -Y\\ 2 <e 

under the assumption of the restricted isometry property (RIP): 

a(l-^)||Z|| 2 < ||FZ|| 2 <a(l + 4)||Z|| 2 

for some constant a > and all fc-sparse Z where 5k is the k-th order restricted isometry 
constant. More precisely, we have the following statement j3 , 25 . 

Proposition 1. Let tj e [0, 1], j — 1, 2, . . . ,m be independent uniform random variables. If 
ill) — ^ 



TTt I 

> C7sln 2 slnnln- , (5 G (0, 1) 

mm p 



for some universal constant C and sparsity level s, then the random Fourier measurements 
[_i_e 27ri ^] ; = l, . . . , n , satisfy the RIP with 5 2s < V2 - 1 and the BPDN solution X 
satisfies 

X-X 



<c - 

2 \/S 



-X\l +C 1 ! \\E\ 



2 ' 



X-X 



< C \\X^ -X^ + d \\E\ 



for some constants ,C\, with probability at least 1 — f3. Here X^ is the best s-sparse ap- 
proximation of X. 
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According to the above result, the (normalized) sampling times should be chosen randomly 
and uniformly in [0, 1] and the number of time samples m needs to be sufficiently large (11 ). 
This result is valid as long as the point targets are located exactly on the grid points which 
is a stringent and often unrealistic assumption. 

3.2. Off-grid targets. In practice, the time delays {r k } may not sit exactly on the grid 
points {kAr}. The mismatch between the actual signal and the signal model creates the 
gridding error which leads to poor performance in CS reconstruction (8 , 16, 17 



To deal with this problem we can make the model more accurate to compensate for the 
gridding error. Let r k = kAr + with \£ k \ < 4r. Then 

Vfa) = ^2pkfhc(tj - r k ) 

k 

(12) = /Lo(ti) J> fe /Lc(-T fe )e- 27ri ^ 

k 
k 

Suppose that B max \^ k \ is small. Then we can write 

e-ariai^ = ^5 = g-wiBfc ^ _ 2mB£, k {tj - 1/2) + 0(B 2 max |£ fc | 2 ) 

Let Q = BAt E N be the resolution-time-bandwidth product (RTBP). With 

(13) Y 3 



y(tj) 



(14) X k = Pk f LC (-kAr - ^ k )e- mB ^ , X' k = -2mQ^ k X k , g, = A 

Zat 

(15) F jtk = exp [-2mQk%] , G j>k = (t 3 - l/2)F jk 
the linear system takes the form 

Yj = ^2 { F hkX k + Gj,kX' k ) + Ej 



or equivalent ly 

(16) Y=[F G] 



X 
X' 



+ E. 



After X and X' are solved from the system, we can estimate and {p k }, respectively, by 



6 



iArXL 



k 



2nQX k 
and 

X k e™ B ^ 
Pk ~ f LC (-kAT-£ k y 
It is generally difficult to establish RIP for matrices other than random partial Fourier 
matrices and random matrices of independently and identically distributed (i.i.d.) entries 
such as [F G]. An alternative approach is via the notion of mutual coherence [i. The mutual 
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coherence of a matrix A is defined by the maximum normalized inner product between 
columns of A: 



(17) 



MA) 



max 



l^il 



#3 ||Aj|| 2 ||^4il|2 



The lower the level of mutual coherence the better performance is with the system. 
The following lemma states a coherence bound for the system (16). 

Lemma 1. Suppose 2n < <5exp [K 2 /2\. Then the mutual coherence \x of the combined sensing 
matrix A = [F G] satisfies 



(18) 



fi<C 



V2K 1 
+ 



2nQ 



for some universal constant C , with probability greater than (1 — 5) 2 — 4e m / 18 . 

This lemma says that to reduce the mutual coherence of the sensing matrix one should 
increase the number of data and the RTBP. The proof of the lemma is given in appendix |Aj 

We then obtain an error bound from Lemma [T] in conjunction with the following result 
(Theorem 3.1 of pi). 



Proposition 2. Suppose the data is noisy and the £ 2 -norm of the noise is less than e. If the 
sparsity of X satisfies 



(19) 



1/ 1. 
S< 4< 1+ ^ 



then the BPDN solution X satisfies the error bound 



(20) 



X -X 



< 



4e 2 



2 ~ 1 - (4s - l)fi' 



Observe that suppA 7 " = suppX'. This property can be utilized in the greedy pursuit (such 
as orthogonal matching pursuit) as follows. A common stage for any greedy pursuit is to 
choose the index corresponding to the columns of the maximum coherence with the residual 
vector. Since X, X' have the same sparse structure, one may utilize a priori information: 
choose both k-th columns from F and G, and test the size of the projected vector from Y 
on the span of the two columns. 



The sparsity constraint (19) can be relaxed if the targets are randomly distributed in 



distance (time delay) as stated in the following proposition [5] . 

Proposition 3. Suppose the data noises are additive Gaussian noise of variance a 2 . Assume 
that the targets are uniformly randomly distributed and s-sparse such that 

< 21 > s * ^T^' 

|| A|| 2 logra 

Assume also 

Co 



(22) fi < 



logn 

5 



(23) min|X fc | > 8a V '2 logn 
i/ien the solution X of the LASSO 

(24) mm XaWZ^ + ^WY - AZ\\ 2 2 , A = 2^2 logn 

ofreys supp(X) = supp(X) with probability at least 1 — (9(n -1 ). 

Once the target support is exactly recovered the target strengths can be calculated by a 
simple least-squares step to obtain a nearly optimal reconstruction. 



On the one hand, (21) demands little on the mutual coherence. On the other hand, in 



order for (21) to improve over (19) we need a sufficiently strong spectral norm bound. We 



prove the following spectral norm bound in appendix |B} 



Lemma 2. The sensing matrix in (15) satisfies the spectral norm bound 



\A\\ 2 2 < In 



with probability greater than 

nt 1 \ \ m(m— 1) 

C (m — P x 



n 

where C is an absolute constant. 

Lemmas [TJ [2] and Proposition [3] imply that LASSO can achieve nearly optimal reconstruc- 
tion of randomly distributed targets whose number is on the order of the data number m, 
modulo a logarithmic factor. This is stated in the following theorem. 

Theorem 1. Suppose the data noises are additive Gaussian noise of variance a 2 . Assume 
that the targets are uniformly randomly distributed and s-sparse such that 

C\m 

s < 



2 logn 
Suppose that 

V2K < c 2 



y/m 2irQ C log n 



If (23) holds, then the LASSO (24) recovers the target support exactly with probability at 



least 1 - 0(n- 1 ). 



4. Moving targets 

Next we discuss the case of moving targets and include Doppler shift in the signal model. 

6 



4.1. On-grid targets. With / = fhc(t) we obtain the signal model from Q 

y(tj) = X^ ex P [ 27ri (y(*j ~ Tfc ) 2 + w ofe - T-fe))] e~ 2 ™^ 

= /Lc(*i) 5^ x k,ifLc(~n) exp [-2vri (aiT& + u t ) tj) . 

k,l 

Let r k = kAr, u\ = (I — n/2)Au with /c, / = 1, . . . , n. With 

Am 
1 nAr 

we set up the following grid {-y p } 

lp = Tk + ui/otx = (k + n(l - n/2))Ar = (p + n - n 2 /2)Ar 
where p = + n(i — 1) G {1, . . . , n 2 }. By choosing 

we can write the linear system Y = FX + E in terms of 

(25) y. = ^(^') c Tri(2n-n 2 )Qti 

ho(tj) 

(26) X p = x k> if LC (-r k ) 

(27) = exp [-27riQp5] e C mx " 2 

where, with the uniform random variables over [0, T], j = 1, 2, . . . , m, the sensing matrix F 
becomes the random partial Fourier matrix of size mxn 2 with which BPDN is guaranteed to 
perform according to Proposition [TJ Note with the above choice of parameters the bandwidth 
B = Q/ At is the same as in ([7]) . 

4.2. Off-grid targets. Suppose the actual time delays {r k } and Doppler shifts {u{\ are not 
exactly on the grid: r k = kAr + £ k , Ui — (I — n/2)Au + r/i. Eq. Q with linear chirp now 
becomes 

k,l 

k,l 

= hc(tj) 5>M/Lc(-fcAT - ^) e - 2 -(o !1 fcAr+(Z-n/2)A„) t3 . e -2 7 ri(a 1 ^ +J7! )^. 
k,l 



With At, Am, and 7 P as above, the approximation 

e -2^i(a l5fc +r ?; )t J = e -2^BC P t 3 _ g-KiBfc ^ _ 2^5^(5 " 1/2)) , C P = 6 + V«l 



for small B\( p \ now leads to (16) with 

hc{tj) 

(29) X p = Xk>l f LC (-kAr - t k )e-* iB ^ , X£ = -2iriB{ p X p 

(30) = exp [-27riQjrf}] , = (5 - l/2)F iP . 

The sensing matrix A = [F G] obeys the same coherence bound in Lemma [l] and thus has 
the performance guarantee given in Proposition [2j Once X and X' are reconstructed, we 
can compute {( p } by 

c - iX ' p 

Even though we are unable to determine and {rji} separately, the sparsity patterns 
of X and X determine the time delays and Doppler shifts up to the precision of At and 
Am, respectively. Moreover, the magnitude of the reflectivity can be recovered since \pk,i\ = 

\ x k,l\ = \X p \. 

4.3. Quadratic chirp. For purpose of comparison, let us the signal model ^ with / = /qo 
The received signal takes the form 

n 

— 2iriuitj 



f = Xk ' 1 exp 2?ri ~~ Tfc ) 3 + y ^' ~ Tk ^ 2 + ~ Tfc 

(31) = /qc(*j) ^2 x k,ifQc(-r k ) exp [2vri (Wj(r fc 2 - t 5 T k ) ~ tj(ptm - ui))] 



k,l=l 



Instead of randomly time sampling, we use uniform sampling tj = jAt, j — 1, 2, . . . , m with 
At = T/m. With the combined index p = k + n(l — 1) as before the linear system becomes 

(32) Yj - 



(33) X p = x k) ifQ C {-Tk) 

(34) F ip = exp [2ni (a 2 t j (-t j T k + r k ) - t j (a 1 r k - ui))] 

Lemma 3. With the on-grid targets r k = kAr, u\ = lAu, the uniform sampling tj = jAt,j 
1,2, ... ,m, as well as the parameters 

m 

(35) «2 = rp2^ (or its integer multiple) 

(36) oil = (or its integer multiple) 

(37) Am = — (or its integer multiple) 

(38) T = mAr 

m = n 



the mutual coherence o/Fe C nXn ^ n (34) becomes 

■2 

(39) Fj p = exp 



— 2?ri fc^ + (— ^ + A; + Z) 

n n 



p = k + n(l — 1) 



and satisfies the bound 
(40) 

for any prime number n > 2. 



MF)< 



n 



The proof of Lemma [3] is given in appendix [Cj Lemma [3] in conjunction with Proposition [2] 
then implies that up to 0(^Jn) targets can be recovered stably by BPDN with QC. As we will 
see in the numerical examples below, QC with random sampling and uniform sampling have 
essentially the same numerical performance even though we do not have the performance 
guarantee for QC with random sampling (Fig. [1] and |5|. 

Note that with the choice of parameters in Lemmap] the bandwidth becomes B = (n + 
1)/Ar. The coherence bounds for LC (Lemma [T]) and QC (Lemma [3]) are then comparable 
with Q = n + 1. And indeed, their numerical performances are similar to each other when 
their bandwidth are same (Fig. [4] and [5]) 



For off-grid targets we apply the same linearization technique with the parameters (35)- 



(38) and obtain the following linear system: 

y(tj) 



(41) 


Yi 


(42) 


X p 


(43) 


X p 


(44) 


F 


(45) 


G jp 


(46) 





x k ,i /qc ( - k At) e ^m k +^l) e ^mi k /2 e -2ni m ^t 3 e -^ k e -vi% 



■- -2ni£ k X p , Xp = -2nifjiX p , £ k = rji 

■- exp [2ni (tj(k 2 - mtjk) + tj(l - k)) ] 
= (tj - 1/2) (-2k + m(tj - 1/2) + l)F jp 
■■ (tj-l/2)F jp 

Because of the complexity of the system we are unable to prove the incoherence property for 



Au 



5. Compressive spotlight SAR 

In this section, we consider the spotlight synthetic aperture radar (SAR) for a stationary 
scene, represented by the reflectivity p(r) of spatial variables r only. For simplicity of the 
presentation, we focus on the case of two dimensions r = (ri,^)- The adaption to three 
dimensions is straightforward. 

In standard radar processing, the received signal, upon receive, is typically deramped by 



mixing the echo with the reference transmitted chirp 20 . Under the start-stop approxi- 
mation and a far-field assumption the deramp processing produces samples of the Fourier 
transform of the Radon projection, orthogonal to the radar look direction, of the scene re- 
flectivity multiplied by a quadratic phase term. Furthermore, if the time-bandwidth product 
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TB = a{F 2 is significantly larger than the total number n of resolution cells, the quadratic 
phase term can be neglected and the deramped signal can be written simply as 22 

(47) y(v(t),e) =T\p\{y(t) cos 6,v(t) sin^), u(t) = - (w„ + - r )) 

Co 

where J 7 stands for the 2-d Fourier transform, 9 is the look angle and To the round-trip travel 
time to the scene center. For a sufficiently small scene, u(t) is effectively restricted to 



(4* 



v G — [u ,ujo + a{T]. 

Co 



We will first discuss the ideal case of full spatial frequency band 
(49) v G [0, i/,] 



and consider the partial bandwidth case in Section |5.3| We will return to the narrow band 
case (48) with 

v Q = 2u /c > 2«iT/c 

in Section l5~4l 

5.1. OfF-grid point targets. Let the computation domain be the finite square lattice 
(50) C = {£(pi,p 2 ) ■ Pi,Pa = 1,-, Vn} ■ 

The total number of cells n is a perfect square. For the off-grid targets represented by 
p(r) = ^ Pp 5 ( r ^ h p)> h P = {hi Pi h^), \h lp \, \h 2p \ < 1/2 



the signal model (47) becomes 



y(y,6) = ^ Ppe -™-(P+ h p) 

PGZ 2 

where d = (cos 6, sin 6) denotes the direction of look. Following the same perturbation 
technique 

e -2^d.(p+h p ) = e -2Mud.p^ _ 2n i£ u d ■ h p ) + 0(\£ud ■ h p | 2 ) 

we consider the signal model 

(51) y(u,6) = ^p p e- 2 ^ ap (l-27ri^d-h p )+w(^,e) 

pel 2 

where w is the error. 

Let (cik,bk),k = l,...,m, be i.i.d. uniform random variables on [—1/2, 1/2] 2 . Set Vk,9k 
such that 

(52) iv k cos 6 k = o fc , lv k sin 8 k = b k 



and let Y k = yiyki^h)- F° r (52) to have a solution in (49) it is necessary that lv* > 1/2 or 



equivalently I > 1/(2^). In the small scene limit U* = Uq + a\T, the resolution limit 

(53) £ > 77 °° 

V ; ~ 4(w + aiT) 

is half of the standard resolution limit for passive radars owing to the two-way travel of the 
active radar signal. 
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Define the primary and secondary target vectors by 

Xi = p p , XI = -27ri£h p X h l = {p 2 -l)V^ + Pi 
where (X;) G C n , {X[) G C 2n . The approximate signal model takes the form (16) with 

F k = e - 2 ^ a fc-P, G k = e- 2wieuk2k - p is k d k 



which is the two-dimensional version of ( 30 ) . 

The coherence bound given in Lemma [1] holds true with a slight adjustment of the con- 
stants. Thus this system has the performance guarantee stated in Proposition [2j 

5.2. Localized target. Instead of point targets, we now consider extended targets that are 
sparse in the pixel basis. We pixelate the scene with n pixels of size £. The centers of the 
pixels are identified as the finite square lattice C. Let D p denote the p-th pixel, Vp G N 2 . 
Suppose the scene is represented in the pixel basis as 

P eN 2 



where I p is the indicator function of D p . Then (47) becomes 



(54) y(y, 6) = I 2 V ^-p cob g) ain^sing) £ ^ 



We shall reconstruct {pj} from eq. (54) with the measurement data y(vk, Ok), k = 1 



where v k is the spatial frequency corresponding to the look angle 9 k . 



Let Vkidk be determined as in (52) with i.i.d. uniform random variables (a k ,b k ),k 
l,...,m, on [-1/2, 1/2] 2 and 

2/(ffc, Ok) 



sine a k ■ sine bk 



Then we can rewrite (54) as the standard CS problem with a random partial Fourier matrix 
of two dimensions 

Yi. = J2 P 3 e 2 ^ a ^ +b ^\ j = (p 2 - 1) + Pi 
j 

which has the same form as the one for on-grid targets and thus the CS performance guar- 
antee given in Proposition [T] if the target is sufficiently sparse in the pixel basis. 

The performance guarantee can be extended to the case of piecewise constant targets if the 
&x minimization in BPDN is replaced by the total-variation (TV) minimization. A version 
of the performance guarantee for TV-minimization is given in [15] . 

5.3. Distributed target. Finally we consider the case of distributed targets which do not 
have a sparse representation in the pixel basis. Although the Fourier basis may seem at 
first a natural choice for representing such targets it is not suitable for our purpose since the 
scene of a sparse Fourier representation will yield a sparse set of data which will be missed 
with high probability by any sparse sampling scheme. 

Instead, we represent the reflectivity function in the Littlewood-Paley basis with the 
mother wavelet 

(55) ?/H r ) = (7r 2 ^ir2) _1 (sin (27rri) — sin (7rri)) • (sin (27rr2) — sin (71T2)) 
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which has the following Fourier transform 

< 56 > = 1 fthedte" 1 

The following set of functions 

(57) Vp,q(r) = 2-( pi+P2 )/ 2 ^(2- p r - q), p, q G Z 2 
with the notation 

2 p r = {2- pi ri ,2~ P2 r 2 ) 

forms an orthonormal basis in L 2 (IR 2 ) [To]. In the Littlewood-Paley basis we have the 
expansion 

(58) p(n,r 2 ) = Pp,q^p,q(n,r 2 ) 

for any square integrable function. 

Let dfc = (cos#fc,sin#fc), k = l,...,m be the direction of look and let vj. be the spatial 
frequency corresponding to the look angle 9k- Taking the Fourier transform we obtain from 
(g that 

(59) y(u k ,9 k )=2n £ 2^ + ^)/ 2 p p , q e- 2 ^ 2Pafc ^(z/ fc 2Pd fc ), fc = l,...,n. 

P ,qez 2 

Now we describe how to select (9 k , Vk), k = 1, . . . , m. First we define some notation. Let 
p-(i,i) 

I = (2nj + l) 2 + (gi + n p )(2n p + 1) + (q 2 + n p + 1), (qj^ < n p , |p|oo < P*, 

j=(-p»i-p») 
p'-(i,i) 

fc = (2mj + l) 2 + (g' x + m p /)(2m p / + 1) + (q 2 + m p / + 1), |q'|oo < m p >, |p'|oo < V* 

j=(-p*,-p.) 

be the column and row indexes, respectively, of the sensing matrix (see below) for some 
m p ,n p which are the numbers of row and columns, respectively of, the block p (see below) 
and p» 6N which determines the maximum number, 2p* + 1, of dyadic scales. It is important 
to keep in mind the relationship between k, I and (p', q'), (p, q) in order to follow the scheme 
described below. 

The main idea is to select (8k,Vk),k = 1, . . . , m such that the sensing matrix becomes 
block-diagonal with each block being a random partial Fourie_r_ matrix in two dimensions 

To this end, z^d^. should 
i>e realized by the following 



which has the CS performance guarantee given in Proposition [l 
be randomly and uniformly distributed on [— 1/2, 1/2] 2 . This can 
scheme which is a modification of the sampling scheme presented in 14 

For fixed p' = (p[,p 2 ) G Z, 2 , let (ak,bk) be independent, uniform random variables on 
[-1/2, 1/2] 2 and define 

' 1/2 + a k , a k e [0, 1/2] 
-1/2 + a k , a k G [-1/2,0] 

1/2 + b k , b k e [0,1/2] 
-l/2 + & fc , bk G [—1/2, 0] 
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ak 



bk 



Clearly, 
(60) 

Let v k , 9 k such that 

(61) 
(62) 



a k ,b k G [-1, -1/2] U [1/2,1]. 

v k 2 p ' 1 cos 9 k = a k 
VkF'i sm9 k = b k . 



The constraint (60) then implies 

(63) u k > max (2~ p i- 1 , 2-^-i) 

meaning that the spatial frequency must be greater than half of the inverse of the resolved 
scale. Altogether, (60)-(62) implies that for the Littlewood-Paley scheme to work for any 
specific scale, we need about 50% bandwidth instead of the full bandwidth such as given in 



(64) 



Define the sensing matrix A = [Ak,i] with 

1 



.4 



k.i 



2n D + l 



3 -sgn(a fc )7ri<ji -sgn(b fe )7ri<j 2 



iP{v k 2 p d k )e 



-27riv fc 2Pd fc -q 



By (56), (61) and (62) it is clear that A^i are zero if p ^ p'. Indeed, we have 
^(z/ fc 2 p d fe )e- 2 ^ 2Pa - q = ^(2 p - p '(a k ,b k ))e 2 ^ P '^ b ^ 

which vanishes if p' p thanks to the support constraint of ip. Consequently the sensing 
matrix is the block-diagonal matrix with each block (indexed by p) in the form of random 
Fourier matrix 

(65 > **> = ^TTT' 



2ni(q 1 a k +q 2 b k ) 



where sgn(afc) and sgn(6fc) are, respectively, the signs of a k and b k . The random phase term 
e sgn(£fc)mgi e sgn(r? fc )7ng 2 can f ur ther absorbed in the definition of data and plays no role in 
the performance analysis. The rest of the sensing elements form a random partial Fourier 
matrix. 

Let X = (Xi) E C n be the target vector with 

Xi = 27r(2m p + i)2(Pi+P2)/ 2 p Piq e s g n (^)^i e s g n ( fcfc ) 7ri ' ?2 

and 

h=-p* fa=-p* 

The block-diagonal structure of A implies that the target structures of different dyadic scales 
are decoupled and can be determined separately by CS techniques. 

Under the assumptions of Proposition [T] we obtain the following theorem. 



Theorem 2. Suppose that for a given p, |p| < p* ; (11) is satisfied for n p ,m p and the target 
sparsity s p on the scale 2 P . Suppose 



(66) 



y 2max(— pi— 1,— pa— 1) 



and that the data are contaminated by noise of £ 2 -norm e t 



13 



Let ai, bi be independently and uniformly distributed on [—1/2, 1/2]. Let the look directions 
and the spatial frequencies be determined by (61) -(62). Then the BPDN solution X p satisfies 



(67) 



\X P - X p \\ 2 < C lS -^ 2 \\X v - Xg»% + C 2 e p 



with probability at least 1 — (3 where C\ and C 2 are absolute constants. 



Remark 1. In the small scene limit, the spatial frequency range (48) and (66) imply the 
resolution limit 

c 



2 P > 



4(u + aiT) 



which is the same as (53). 



5.4. High frequency /narrow band limit. The preceding sampling scheme for SAR may 
be a bit cumbersome and unpractical. A far more practical sampling scheme is to indepen- 
dently select 9k, k = l,...,mi independently from [0, 2w] and v\,l = l,...,m 2 from (48). 
We also want to obey the spatial frequency constraint (48) with ot\T <C ujq. In particular 
we assume that look angles 9^ are i.i.d. with the probability density function f(9). In what 
follows we shall consider the high frequency limit and let 



(68) 



> 1, v Q = 2u /c 



and a{F be fixed. We will consider on-grid targets only. We prove in Appendix [C] the 
following coherence bound. 



5,K>0. 



Lemma 4. Suppose 

(69) n < 

o 

Then the sensing matrix satisfies the coherence bound 

V2K 



(70) 



fi < /i 



with probability greater than (1 — 5) 2 where in general x 1 (resp. \ s ) satisfies the bound 
(71) /2< c 7 (l + zV)- 1/2 ' 



7,00 



where 



1 7)°o 



is the Holder norm of order 7 > 1/2 and the constant c 7 depends only on 7. 



Note that the above coherence bound does not require full, circular view of the scene, 
but the smoothness of the sampling density function / which depends indirectly on the size 
of the support of /. Lemma [4] in conjunction with Proposition [2] yield an error bound for 
BPDN which depends entirely on the sampling angles and high base frequency and thus is 
a very crude estimate. 

To see how spatial frequency sampling can improve the coherence bound, consider the 
case of circular SAR with randomly and uniformly distributed looks on [0,27r]. Then the 
argument in Appendix [O shows that 



(72) 



max ■ 



T712 



mi 

£■ 

k=l 



-27rifi^d fc -(p-p') 
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^ ma 

max — > J (2 
p^p' m 2 ^ 



Tit 



IP 



P'i) 



where the zeroth order Bessel function J has the large-argument asymptotic 

Y 




(73) J (z) = \ — {cos (z - 7i/ A) + 0(l/\z\)} , z > 1. 

TTZ 



On the one hand, the worst case bound is 
(74) Jo(27r^|p-p , |) < 



Vp^P, Vj 



for some c > 0. On the other hand, if the standard resolution criterion 
(75) £aiT/c > 1/2 



is satisfied, then the argument 2ir£uj\p — p'| ranges over at least one period, with v~ in (48) 



and p, p' G £, and one can deliberately select a sequence of spatial frequencies to minimize 



the right hand side of (72) in view of the sinusoidal nature of the leading asymptotic in 



(73). In principle, the leading order term in (73) can be made to cancel out which yields the 



improved bound 

Likewise, the fluctuation part of /i, which is bounded by the second term of the right hand 



side of (70), can also be reduced by an irregular sampling of spatial frequencies. 



More generally, let / be represented by the Fourier series 



/(*) 



i 



Then 



max27ry a*e i ^p'-P+ 7r / 2 ) J z (2tt^,) 
p^p' Y 



where Ji is the Bessel function of order I and P >_ P the angle of p' — p (cf. (91)). The large 
argument asymptotic for J\ 



(76) 



JAz) = J— {cos (z - ln/2 - tt/4) + OG^ 1 )} > z > 1 

V 7TZ 



suggests a similar reduction of /x under (75) by judicious choice of spatial frequencies. The 
l^l -1 / 2 decay of the asymptotic (76) indicates the spatial frequencies should weigh (slightly) 



more heavily on large spatial frequencies. 



6. Numerical examples 



In the following simulations, we use YALL1 27 to solve BPDN. BPDN, however, does 



not take advantage of the common support of X and X'. The off-grid perturbations 
are i.i.d. uniform random variables in [— 0.4Ar, 0.4Ar]. 

In the first set of simulations, we test the case of immobile targets which produce noiseless 
signals according to (12) with the parameters m = 64, n = 128, SNR = 100, and Q = 2. 



Fig. [T] shows the results of BPDN reconstruction of 10 well separated targets. Reconstruc- 
tion based on eq. (p|-(10) is clearly superior to that based on eq. ( 13 )-( 15 ). Fig. |2| shows 



an even greater improvement when the targets are closely spaced. 
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|o(t)|, off-grid fixing off, m=64, n=128, s=10, SNR=100, u-0.31 19 



|o(-r)|, off-grid fixing on, m-64, n-128, s-10, SNR-100, u-0.42943 



50 100 



Figure 1. BPDN reconstruction (in blue) of off-grid targets with (left) eq. 
([8])-( 10) and (right) eq. ( 13 )-( 15 ). Red circles represent the exact values. 



|o(t)|, off-grid fixing off, m-64, n=128, s=2, SNR=100, n-0.27262 



(t)|, off-grid fixing on, m=64, n-128, s=2, SNR=100, n=0.36896 




-100 -50 50 




Figure 2. BPDN reconstruction (in blue) of closely spaced targets separated 
by 0.2Ar with (left) eq. ([8])-(10) and (right) eq. ( 13 )-( 15 ). Red circles repre- 
sent the exact values. 



For the second set of simulations, we test the case of moving targets. The parameters are 
chosen as m = 150, n = 625, s = 10, SNR = 100, and Q = 2. 



Fig. 3 shows significant improvement of reconstruction based on eq. (28)-(30) over that 



based on eq. (25)-(27). 

In Figure [4] and |5J we present numerical comparisons between LC and QC with the following 
parameters 

• For LC, At = 1, Am = 1/m, m = n = 47, T = 1/Aw = m, and the corresponding 
bandwidth is B = 47. 

• For QC, At = 1, Au = 1/m, m = n = 47, At = At, T = nAt. By setting a x = 0, 
the corresponding bandwidth is B = 47. 

In both cases we set the carrier frequency uq = 1000. In addition to equally-spaced sampling 



in (34) we also consider random sampling with QC as in the case of LC. 

16 



« ** *> 

5 «< x 



-600 -400 



200 400 



-600 -400 -200 



Range-Doppler Image, off-grid fixing off. 
m=150, n=625 : s=10, SNR=100, |.i=0.19756 



Range-Doppler Image, off-grid fixing on. 
m=150, n=625, s=10 : SNR=100, ^=0.49122 
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Figure 3. BPDN reconstruction of \x k ,i\ with (left) eq. (25)-(27) and (right) 



eq. (28)-(30). The top plots are the combined vector view and the bottom are 



the range-Doppler view. 



In addition to BPDN, the greedy algorithm, Subspace Pursuit (SP) [9], is also used for 
reconstruction as the greedy algorithm tends to be much faster than solving BPDN. More- 
over, one can easily modify the SP algorithm to take advantage of the prior information 
suppX = suppX' in the case of off-grid targets (see (14) & (29)). Two definitions of "suc- 
cess" in recovery are used: the first one is defined by the relative mean square error (MSE) 
less than 0.1 and the second one is defined by exact recovery of the target support. 

Fig. [I] and [5] show essentially no difference in performance between LC and QC as long 
as their bandwidth is the same. There is also no difference in performance between equally- 
spaced sampling and random sampling (labelled as "randt" in Fig. |4j [5]) with QC. Fig. [4] 
shows that SP slightly outperforms BPDN (as implemented by YALL1). 

For the case of localized extended targets, we use the 250 x 200 aerial image of Boston's 
Logan airport (Fig|6]left). The image with 19757 nonzero pixels is not strictly sparse but may 
be considered compressible. For the sake of speed we use SP with sparsity parameters much 
smaller than the actual sparsity, 19757, of the scene. The reconstruction with m = 10000 
and sparsity parameter 2000 captures main features of the scene (Fig [6] middle). When the 
number of data increases to m — 15000 the reconstruction with sparsity parameter 4000 
shows more fine-scale details (Fig [6] right). 
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Figure 4. Success probabilities of (left) BPDN and (right) SP reconstructions 
with 1% noise. 




Figure 5. Success probabilities of SP reconstruction with (left) 5% and the 
full data m = 47 noise and (right) 1% noise (right) and an half data m = 23. 
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Figure 6. The 250 x 200 scene (left) and the SP reconstruction with (middle) 
m = 10000 and (right) m = 15000. 



For the case of distributed targets, we simulate the data with the reflectivity function 
which depends only on one variable p(ri) and has a sparse representation in the Littlewood- 
Paley basis. Figure [7] shows the result of reconstruction with the following parameters: 

=2,n p = 100, Vp; for p = -2, -1, 0, 1, 2, s p = 12, 24, 13, 24, 23, m p = 36, 64, 36, 64, 64, e = 
0.5134,1.4849,0.6520,1.0274,1.3681 equivalent to the relative noise level = 0.0500, 0.0649, 
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-5 -4 -3 -2 -1 1 2 3 4 5 

Figure 7. Imaging of a continuous scene in the presence of roughly 6% 
noise: The red-solid curve (top) is the exact profile and the blue * shows the 



reconstructed profile. 



0.0494,0.0640,0.0698. The resulting relative errors of reconstruction are 0.1941, 0.1240, 
0.2498,0.2361, 0.1502. 

7. Conclusion 

We have explored compressed sensing approach to monostatic radar as well as spotlight 
SAR with chirped signal. In particular, we have proposed a simple method for dealing with 
off-grid targets and established a performance guarantee for the method. 

We have extended our approach to the spotlight SAR in the tomographic formulation. We 
have proposed sampling schemes to deal with several classes of sparse targets under various 
bandwidth constraints and showed that the CS schemes can achieve standard resolution 
limits with sparse sampling. Our numerical experiments are consistent with the theoretical 
results. 



Appendix A. Proof of Lemma [T] 

Proof. We prove the coherence bound for the matrix A = [F G]. 
The k-th column vector of A is given by 

3k ~ \ (tj - 1 /2)e- 2 ^ k -n)% ? k>n _ 

Note that ||Ajt|| 2 = m, k < n and E[||Afc|| 2 ] = j|, k > n. Consequently, the scalar product of 
two distinct columns of A has three possible forms: 

in 

bkk' = ex P [ 2ni Q( k ~ k %] > a ( f j) = !> (5 - V 2 )> or (5 - V2) 2 , 

i=l 19 



for k, k' — 1, . . . , n. When a(tj) = 1 or (tj — 1/2) 2 both columns are drawn from F or G and 
thus k 7^ k'. When a(tj) = tj — 1/2, one column is drawn from F and the other from G. In 
the last case, k and k! are arbitrary. 

Let S m = YOjLi u j, T m = EJLi Vj where 

Uj = a(tj) cos [2nQ(k - k%] , Vj = a(tj) sin [2nQ(k - k%] 

are independent (for different j) random variables in [—1, 1]. We have 

\bkk'\ < \hk> -^(hk') \ + \^(hk')\ 

= | S m + \T m - ES m - iET m | + \E(S m + \T m ) \ . 

Recall the Hoeffding inequality. 

Proposition 4. Let U\, . . . , U m be independent random variables, and S m = YlJLi Uj- As- 
sume that Uj G [u, v], j — 1, 2, . . . , m almost surely, then we have 

(77) P(] S m - ES m \>mt)<2 exp 

for all positive t. 

Choosing t = Kj y/m for some constant K, we have 

F(\S m - ES m \ > y/mK) < 2 exp [-K 2 /2] . 

Note that the quantities S m depend on k — k' but there are at most n — \ different values. 
The union bound yields 

r max \S m - ES m \ > y/mK) < 2(n - 1) exp \-K 2 /2] , 

and similarly 

P(max \T m - ET m \ > s/m\K) < 2{n - 1) exp \-K 2 /2] . 

ky^k' 

We have 

P(max | b k k' - E&fcfc/ 1 < V2mK) 

ky^k' 

= P(max | S m + \T m - ES m - iET m \ < \/2^nK) 

ky^k' 

> (l -2(n- I) exp [~K 2 /2] > (1 - 5) 2 
if 5 > 2nexp \-K 2 /2}. 

Now let us estimate the mean E(bkk') or K(S m +iT m ) for k / k' . Note that tj, j — 1, . . . , m, 
are independently and uniformly distributed in [0, 1]. We have three different cases: 

(1) For a(tj) = 1, 

E(b kk >) =m f e 2 ^ h - k ^ dt = 0. 
Jo 
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2m 2 t 2 
2~2j(v-u) 2 



(2) For a(t 3 ) = t 3 - 1/2 
■l 



E(b kk >) = m 
and thus 



(t - l/2)e 27riQ ^- k '^dt = me ^Q(k-k')Szll k ± fc / 

2-ni{k — k')Q 



m 



On the other hand, 



\®(b w )\<-—, kjtk' 



E(b kk ) = m ! (t - 1/2) • 1 dt = 0. 
Jo 

(3) For a(tj) = g - 1/2) 2 , 

E(6 fcJfc ,) = m / (t - 1 /2) 2 e 2n[Q ^- k,)t dt = me niQ(k - k ' \ { - -— , fc ^ fc' 



(2tt(A; - A;')<2) ; 



and thus 



For fc, fc' < n, since ||^4fc|| 2 = m, 



|E(6 fcfc /)| < 



m 



(7* 



(2ttQ) 2 ' 



C y2K 
bk k > < — V2mK = C — k.k' < n 



'kk' — v miuj-i. — w . — , .u, 

m Jm 



for some universal constant C, with probability greater than (1 — 5) 2 . 
On the other hand, for k > n, 



\M\l = £fe - V2) s 



which is a sum of m i.i.d. random variables of mean 1/12 on [0, 1/4]. Applying Hoeffding 
inequality with t = 1/24, we have 



P (|||^ fc ||2_^| > ^\ < 2e - m / 18 



12' ~ 24 



and thus 



¥ (\\ Ak \\l<— \ <2e- m / 18 . 



We conclude from these observations that 



(79) 



(80) 



buy < 



Ofcfc' < 



C 
m 

C 
m 



'2mK + 



'2mK + 



m 



2nQ 



m 



(2vrQ) 2 J 
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C ■ 



V2K 1 
+ 



C- 



m 2%Q 
\[2K 

+ 



, k <n < k' 



m (2nQ) 2 



, k,k' > n 



with probability at least (1 - 5) 2 - 4 e - m/18 . (78)- (80) are what we set out to prove. 



□ 
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Appendix B. Proof of Lemma 
Proof. Observe that AA* = FF* + GG* with 



(FF* 

(GG* 
Thus 

(81) 



'>■.) 



^wiQitj-t^k _ 2mQ(tj-ti) 



I _ e 2mQ(tj-U) 



E' 

k=l 

n i 2mQ(t—t)n 

£fi - l/2)(5 - l/2)e 2 ^-^ k = (U - l/2)(5 - i/ 2 )e 2 ^fe-^ J^—^L±. , 



fe=i 



(AA*) y | = (l + (ti-l/2)(5-l/2)) 



sin [irQ(tj — ti)n] 



sin [7r(5(ij -U)] 



which can be controlled if the random variables Zij = Q(tj — ti) are bounded away from 
integers. Zy has the density 

fz = I[0,Q] * I[0,Q] 

since ti,tj are independently and uniformly distributed in [0, 1]. Clearly fz is a triangular 
function on [0, 2Q] and satisfies \fz\ < 1/Q- Let 

C = min min { I — k 1 1 . 

Hence, for small 6 > 0, 

F{(>b) > (l-dfo) m - (m ^ 

where the power counts for all possible pairs (ti,tj), 1 < z ^ j < m. Thus, with probability 
greater than (1 — Ci5) m '( m_1 ) we have 



(AA* 



'1,3 



< -r, i ^ J. 
no 



By choosing 6 



2(m-l) 



ii- 
2 



and applying Gershgorin circle theorem, we have || AA* — nl m || 2 < n, 



or equivalently A L < 2n. 



□ 



Appendix C. Proof of Lemma [3] 



Proof. Recall Weyl's theorem (a generalized version of Gauss sum) |6j, 23 
Proposition 5. Let r e N, r > 2, and the prime number q > r. Then 



exp \2ix\{a r k r + a r -ik r ~ 1 + • • ■ + a 2 k 2 + a\k)jq 

k=l 



< (r-l)y/q 



provided aj G N for all j = 1,2, ... , r and (ai, a 2 , . . . , a r ) 7^ (0, 0, . . . , 0) mod g. 

Assume also = kAr and ui = (I — n/2)Au. The inner-product of the p-th and the p'-th 
columns of F is then 



i2) 



m 

F j, P ') = Y] ex p ~ 27ri ( — + — j ) > p ^ p' 
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where 



a,2 = a2mAt 2 Ar(k — k') 

a x = -a 2 mAtAT 2 (k 2 - k' 2 ) + axinAtAr^k - k') + mAtAu(l - I') 



To ensure that a 2 G N, we set 

a 2 = 



1 



mArAt 2 



and, similarly, for a\ G N we set 

a i 
Au 

Furthermore setting 



1 



mAtAr 
1 



mAt 



(or its integer multiple) 



(or its integer multiple) 
or its integer multiple). 



m — n. At = At 



nAu 



we obtain 



a 2 = k - k', ai = -(k 2 - k' 2 ) + (k — k') + I') 

which do not have m as a common factor since \k — k'\ < n — 1. The sensing matrix F in 
(34) becomes 



(83) 



hV 



exp 



-2vri ( k— + (-k 2 + k + l)- 
n n 



If we set ai = 0, the resulting sequence is called Alltop sequence 19 of bandwidth B = n/ At 
and the resulting sensing matrix becomes 



F jtP = exp 



2ni k— + {-k 2 + /)- 



n 



n 



To apply Weyl's theorem, we need only to check that (oi,a 2 ) ^ (0,0) mod n. Since 
\k — k'\ , \ l — l'\ < n — 1, a 2 = mod n if and only if k = k', which implies a\ — if and 
only if I = I'. This can not happen if p ^ p' . By Weyl's theorem, 



implying 



//(F) = max 



( F j^ F j, P ') < Vn. 
(F pj F p/ ) 



p¥=p' \\F p \\\\Fpi 
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< 



n 



□ 



Appendix D. Proof of Lemma [4] 
Proof. The mutual coherence has the form 

1 



*5) 



max 



m,2 mi 



e -27ri&/jdjfc.(p-p') 

j=i fc=i 



p^p' mim 2 

Consider the first summation over k = 1, ...,p. Let 

P k = cos (2ir£ujd k ■ (p - p')), Qk = sin (27r£z/ j d fc • (p - p')) 

and 

mi mi 



k=l 



k=l 



Then the summation can be bounded by 



16) 



nil 



e -27ri%d A -(p-p') 



fc=i 



< V\S mi -E5 mi | 2 + |T mi -ET mi |2 + v/|ES mi | 2 + |ET mi |2 



We apply the Hoeffding inequality to both S mi and T mi . With 

t = K/y/mi, K > 

we obtain 

(87) P[mr 1 |S , mi-lS'mil>^/V^i] < 2e ' R2/2 

(88) P [m^ 1 \T mi - ET mi \ > K/^[] < 2e~ K2/2 . 



Note that the quantities S mi , T mi depend on p - p' but they possess the symmetry: 
<3mi(p — p') — S mi (p' — p),T mi (p — p') = — T mi (p' — p). Furthermore, a moment of reflection 
reveals that thanks to the square symmetry of the lattice there are at most n — 1 different 
values \S mi \ and |T mi | among the n(n — l)/2 pairs of (p, p')- 

We use (87)- (88) and the union bound to obtain 



P 
P 



max m 1 1 \S mi — E5 mi | > if/ ^Jrn[ 
maxmj" 1 |T mi — ET mi | > K/y/mi 



< 2{n - 1) ■ e'^ 2 / 2 

< 2(n - 1) ■ e~ K2 ' 2 



where the factor An is due to the structure of square lattice. Hence, by (|86|) 

mi 



P 



ma^r 1 



~y ^ e -27rifajd fc '(p-p / ) _ jg> 



fe=l 



mi 



i^d fc -(p-p') 



,fc=l 



< v^if/v^ 



> (l-2(n-l)e^ 2 / 2 ) 2 . 



By (69) the right hand side of (89) is greater than (1 — 6) . 
Consider the identity 



(90) — E 
mi 



mi 



y^ e -27rifa J d fc -(p-pQ 



k=l 



2t 



e- 27Tie ^ p - p,) f(9)d9, d = (cos 9, sin 0). 
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Expanding / in the Fourier series 



and denoting the angle of p' — p by p *- p we can write (90) as 



2- 



e iZ0 e 27ri&/,-|p-p'| cosB^g 



(91) 



2vr aze" (e p'-P+-/2) j,( 27r ^. 



where J/ is the Bessel function of order I. The large argument asymptotic for J\ 
J t (z) = \ — {cos (z ~ In /2 - 71 /A) + Odz]- 1 )} , z»l 

V 7T2 



suggests the decay estimate (71). 



Alternatively and more directly, the asymptotic of (90) can be derived by the method of 
stationary phase (Theorem XL 14 and XL 15 of [26]). 



Proposition 6. Let g PtP >(9) = d • (p — p')/|p — p'| which is in C°°([— it, n]), Vp, p' G C. 
(i) Suppose ^g P , P '(dj ^ 0, V0 G [a, b], Vp, p' G C. Then for all f G C#([a, b}) 



(92) 



-27ri^|p-p'| W (<?) 



f{d)dd 



< c^i + wip-p'i) 



l\\-h 



h,oo 



for some constant Ch independent of f . Moreover, since {g p , p ' '■ p, p' G C} is a compact 
subset of C h+1 ([a, b]), the constant Ch can be chosen uniformly for all p, p' G C. 

(ii) Suppose ^g P:P '(6) vanishes at6* G (a, b). Since J^<? p , p '(0*) 7^ ; i/iere erases a constant 
c t , t > 1/2 suc/i i/iai 



(93) 



-2bii&'j[p-p'|fl D . /W 



^cKi+^p-p'ir 1 / 2 



t,oo 



where the constant c t is independent o/p, p' G £. 



Combining the estimates for (90) using (89) and the identity 



UV = {U - U){V -V) + U{V -V) + V(U -U) + UV 



we obtain (70) with probability greater than (1 — 5) 2 . 



□ 
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